Automatic acoustic identification of insects inspired by the speaker recognition paradigm
نویسندگان
چکیده
This work reports our research efforts towards developing efficient equipment for the automatic acoustic recognition of insects. In particular, we discuss the characteristics of the acoustic patterns of a target insect family, namely the cricket family. To address the recognition problem we apply a feature extraction methodology that has been inspired by well documented tactics of speech processing, which were adapted here to the specifics of the sound production mechanism of insects, in combination with state-of-the-art speaker identification technology. We apply this approach to a large and well documented database of families and subfamilies of cricket sounds, and we report results that exceed 99% recognition accuracy on the levels of family and subfamily, and 94% on the level of a specific insect out of 105 species. We deem this equipment will be of practical benefit for non-intrusive acoustic environmental monitoring applications as it is directly expandable to other insect species.
منابع مشابه
Discriminative acoustic language recognition via channel-compensated GMM statistics
We propose a novel design for acoustic feature-based automatic spoken language recognizers. Our design is inspired by recent advances in text-independent speaker recognition, where intraclass variability is modeled by factor analysis in Gaussian mixture model (GMM) space. We use approximations to GMMlikelihoods which allow variable-length data sequences to be represented as statistics of fixed ...
متن کاملEffects of stimulus contents and speaker familiarity on perceptual speaker identification
1. Introduction In daily conversation, listeners use both phonological and speaker information of the utterance in order to achieve successful communication [1]. It is reported that the processing of the phonological content and that of the speaker identity occur separately, although they interact with each other [1,2]. One example of this interaction is that certain sounds are effective for pe...
متن کاملPersian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملFactor analysis based semantic variability compensation for automatic conversation representation
The main objective of this paper is to identify themes from dialogues of telephone conversations in a real-life customer care service. In this task, the word semantic variability contained in these conversations may impact the classification performance by retaining the noise in their vectorial representation. In this article, we propose an original method to compensate this semantic variabilit...
متن کاملDeep Neural Network Bottleneck Features for Acoustic Event Recognition
Bottleneck features have been shown to be effective in improving the accuracy of speaker recognition, language identification and automatic speech recognition. However, few works have focused on bottleneck features for acoustic event recognition. This paper proposes a novel acoustic event recognition framework using bottleneck features derived from a Deep Neural Network (DNN). In addition to co...
متن کامل